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^^BSTRACT 

A Study was conducted that used three formats for 
spellina practice, two multiple-choice or recognition conditions 
based on the Simcn model and one production or recall condition. (The 
Simon aenere.te-and-best model of spelling suggests that spellings can 
he learned from reading or ylsucl exposure to the correct spelling.) 
The multiple-choice formats contained distractors that were either 
nlausible araphemic variations of the target word or misspellings of 
different words. Fifth and sixth grade students learned 15 words 
whose spellings they had missed on a pretest in a small group 
instructional setting in which words had been practiced for either 
three or six trials. Students were blocked into high and low ability 
groups and the effects of variations in training, degree of learning, 
and ability were assessed on an immediate and delayed 
generate-and-tes t measure and on a delayed recognition test. Results 
showed that all training aroups learned more spelling than did the 
control group. However, the recall condition was superior to each 
recognition condition on each test. High ability students 
outperformed low ability students on all three tests, (Author/MKW) 
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THE EFFECTS OF THREE TYPES OF PRACTICE FOR>/LATS 
AND TWO DEGREES OF LEARNING ON THE SPELLING 
PERFORlvlANCE OF ELEMENTARY SCHOOL STUDENTS 



Murray D. Thompson and Karen K, Block 

Learning Research and Development Center 
University of Pittsburgh 

According to the Simon and Simon (1973) theory ot spelling per- 
formance, some words {those that are partially learned) are spelled 
through the use of visual recognition ii^ormation that has been acquired 
through reading. Spelling is considered to be a generate-and-test process 
in which knowledge of phoneme -grapheme correspondences is used to 
generate trial .'spellings which are written then tested against the stored 
word recognition information that is retrieved upon viewing an approxi- 
mate spelling (Simon & Simon, 1973, p. 130). The major empirical 
support for the gener-ate-and-test account of spelling comes from the 
serial position effect found for spelling errors (Kooi, Schutz^ &c Baker^ 
1965)* The notion is that since recognition information is most cle^.r 
at the start and er d of words, and fuzzy in the middle, incorrect 
spellings will pass the test when errors exist in the middle positions 
rather than at the start or end. Therefore, the majority of spelling 
errors should be in the middle of the word, hence the serial position 
effect. 

There are two difficulties with the above argximent. f irst, the 
serial position effec? may be a phenomenon of recall rather than recog- 
nition* That is, it may simply be due to heightened recall of the start 
and ending of words. When generating a spelling of a word they have 
seen before, students may be able to recall (and subsequently write) 
the start and ends of it easily and correctly while having difficxilty 
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remembering the middle. This latter suggestion does not require the 
positing of a post-generation recognition '*test** phase- The second 
difficiilty with the argument is that tiie spelling error data cited in 
support of the model were not collected under conditions where there 
were assurances that proofreading was a regular ~»courrence. A 
requirement of the model (see p. 130) is that the test phase must be 
initiated by presentation of the spei^ling of a word; however, it is \xn- 
reasonable to assume students . subject each spelling they havt- generated 
to a "test, ^' Indeed, classroom spelling teachers must regularly remind 
their students to check the spellings they have generated^ 

The distinction between the above two accoiants is important both 
for '•understanding spelling processes and also because the two accounts 
seem to hav^ quite different implications for instruction in spelling. 
The Simons' (1973) suggest that much of spelling can hL .'vrarned by 
reading because by reading the v/ords one needs to spell, one builds up 
recognition information^ thereby facilitating spelling performance. 
They argue for the importance of correlated vocabularies in reading 
and spelling. This in turn implies that standard forms of teac) ing 
spelling by requiring production and giving feedback, derived from 
viewing spelling as fundamentally a recall rather than recognition pro- 
cess, may be less useful than we otherwise might think. The Simons' 
suggestion that spelling competence can be strongly elevated through 
'"readirti?^* (i. e. , visual exposure to correct spellings of words) is in 
need of empirical test. This study provides data *co assess the relative 
amoiTjnt of achievement that results from multiple -choice (recognition) 
training and constructed response (recall) training. 

It was hypothesized that if practice in re<:ognizing words resulted 
in greater spelling achievement, then recognition must be considered 
a fundamental spelling process. By contrast, iJf practice in recalling 
words resulted in superior achievement, then spelling must be consid- 
ered f\xndamentally a recall rather than recognition phenomenon. A 



multiple -choice format was used to provide practice in "recognising" 
the correct spelling; a constructed response format was used to pro- 
vide practice in ^'recalling^^ the correct spelling* We refer to the two 
types of training procedures as "recognition" and "recall*' because the 
tasks involved are identical to the traditional methods for testing 
recognition and recall. Our names for the training conditions are 
based on external conditions for learning rather than internal ones. 
Each training format may elicit in some students both recognition and 
recall processes because the training tasks permit only limited con- 
trol over spelling processes. 

Two recognition training procedures were developed on the basis 
of hypotheses suggested by Simon and Simon (1973), They suggested 
that spellers can increase spelling proficiency by developing more 
elaborated templates, tlirough visual exporure to the words. Two forms 
of recognition training provided such exposure while varying in the 
nature of the distractors used in multiple -choice tasks. In the high 
recognition condition, the distractors were misspellings of the target 
word, many of which were the most frequent misspelling committed on 
the pretest or other misspellings formed by substituting alternative 
plausible graphemes for selected phonemes in the word. By contrast, 
tlie low recognition condition used misspellings of completely different 
words as distractors. Thus, } :h recognition training conditions pro- 
vided the requisite visual spelling information, but differed in the 
nature of misspelled distractors. It was predicted that high recognition 
training would produce better spelling performance than low recognition 
training. This prediction was based on the assumption that requiring 
a choice among plausible alternative spellings wouJd facilitate the 
learning of the correct spelling, perhaps by forcing attention to relevant 
orthographic features. 

The third practice condition (recall) was a standard method of 
spelling practice. Students attempted to spell a word and were then 
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shown the correct spelling* It was hypothesized that if recognition 
practice was in fact a superior form of spelling practice, more spel- 
lings should be acquired in both recognition conditions than in the 
recall condition. 

In addition to variations in training procedure, the effects of two 
other variables were assessed in this study. Training was carried to 
three or six trials, hence degree of learning was varied. It was ex- 
pected that six practice trials would produce more learning thai? three. 
Sttidents were blocked into high and low ability spellers to assess any 
interactions that ability might have with treatment* 



Ivlethod 

Subjects 

The subjects were 135 fifth- and sixth-grade students from a sub- 
urban elementary school. They were chosen from a larger population 
of 165 stud'^nts on the basis of pretest performance and the spelling 
subtest of the Stanford Achievement Test- 
Students were eligible for the study if they incorrectly spelled at 
least 85% of the words on a 60-item pretest and misspelled all 15 words 
that were used as lea-ning items in the study. 

The median score (obtained by students qualifying for the experi- 
ment) on the spelling subtest of the Stanford Achievement Test was 
used to identify high and low ability spellers. The range or scores in 
grade equivalents was 3- 8 to 7. 2 and the median was 5. 7. Those above 
the median were considered high ability spellers and, inversely, those 
below the median were considered low ability spellers. 

De s ig ^n 

An equal number of fifth and six*:h graders were used. Subjects 
were randomly allocated to the exparimental conciltions such that an 
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eq\ia.l number of fifth-grade, s Ijrth-grade, high ability, and low ability 
spellers were ir* each training condition. Three training conditions 
were used in the study: h\gh recognition, low r t:cognition, and recall. 
Degree of learning (three versus six trials) varied factorially with 
training. Subjects were assigned to groups so that five low and five 
high ability, and five fifth and five sixth graders appeared in each 
combination of training conditions with degree of learning condition* 
There was a total of 120 students included in the main design which 
had four between factors {grade, ability, training, degree of learning) 
and one within factor (tests). A conhrol group that received no training 
was also rim; it consisted of seven lo-^- ability and eight higi* ability 
fifth and sixth graders. 

Apparatus 

An opaque projector and standard movie screen were used to 
present stimulus items and feedback* 

Word Materials 

Sixty spelling words difficult for sixth graders were selected from 
the Iowa Spelling Scale (Greene, 1954) and used in a pretest. The 15 
most frequently nnissed spellings in the pretest were used as the 
learning items for the study. 

Miiltiple- choice items with four distractors were developed fcr 
each of the 15 learriing items for the low recognition and high recog- 
nition condition,-;. In the high recognition condition, four highly similar 
spellings of the target words were used, only one of whicn was the 
correct spelling. The three distractor items for a target word were 
the most frequent misspelling that occurred on the pretest and two 
graphemic v^.riant3 (e.g., forein, foriegn, forien for foreign ). Simi- 
larly, the low recognition condition had four spellings from which to 
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choose* However, the distractor items were misspellings of complete- 
ly different words. The distractors varied in number of pronounceable 
chunks and no single pronounceable chunk found in the target word was 
contained in any distractor. An example set of choices ^or fox*eign 
was: restrant (2 chtmks), agrecullure (3 chunks), and hijean {2 chunks). 

iProcedur e 

A dictated pretest consisting of 60 spelling words from the New 
Iowa Spelling Scale fGreene, 1954) was administered to all fifth- and 
sixth-grade students. Training began seven days after pretesting. All 
training was conducted in a classroom setting. Twenty students were 
exposed to training simultaneously. These students were assigned to 
the same training and degree of learning conditions^ but were from 
different grade levels. Hence, 10 fifth and 10 sixth graders were tested 
i: each session. Two students, who did not qualify for the experiment, 
aided the experimenter. These students were used as rvmners to collect 
subjects for training and return them to their respective classrooms at 
the completion of training. Through the cooperation of the classroom 
teachers, students were not exposed to spelling instruction, oth/ir than 
that associated with experimental training, for the three-week duration 
of the experiment. 

In the recognition conditions, the experimenter projected a multiple- 
choice array of four spellings on a screen, pronounced the target word, 
used it in a sentence, and pronounced it again. Students wrote the 
letter (a,b, c,d) of the spelling they thought correct. The experimenter 
then showed the correct letiier and spelling of the word. Students checked 
their answers marking a '*c'» for correct or a for incorrect. Training 

continued in this way for all 15 target words. The list was recycled in 
the same fashion for either three or six trials. The order of word 
presentation and the position of the correct spelling and distractors for 
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each item was randoraized on each presentation. The conduct of recall 
training was similar, except students wrote a complete spelling then 
were shown the corre t spelling and had to check it "c'' or Degree 
of learning (three or six trials) was also varied in recall. Students in 
the control group received no training, only the tests. 

Six classroom training sessions were conducted in sequence in the 
same room on the same day. At the end of each training session, that 
is, <it the end of three or six times through the list, a "generate-and- 
test** posttest was administered to each training group. Each of the 15 
words was, as above, pronounced and used in a sentence, and pronoixnced 
again. The subjects had the option to write either one or two spellings 
for each item. If two spellings were written, the subjects xmderlined 
the spelling they thought more correct (which actually could be correct 
or not correct). Ten days later, two retention tests were given to all 
groups, including the control group, on a single day. One test followed 
the generate-and-test format used with the immediate posttest. '^he 
other test was a multiple -choice test. Distractors on the miiltiple- 
choice test were the mof t frequent misspelling, a misspelling of a 
completely different word, and a graphemic variant of the target word. 
Order of administration of the retention tests was counterbalanced. No 
feedback was given ujitil both tests were completed. 

Results and Discussion 

Control versus Training ^ 

To determine whether the various training procedures produced 
learning of the words, the training groups were compared to the control 
group. Scores on the retention test were selected for analysis because 
retention is the practically significant measure of the effects of training. 
A one-way analysis of variance computed on the ntimber of correct 
spellings produced in the training and control conditions on the generate- 
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and-test retention test showed a significant main effect (F =15* 97; 
di - 3, 131; ^ < •01). The mean number correct was 6. 73 in the recall 
condition; 3.21 in high recognition; 3. 10 in lov.- recognition; and .33 
in the control group. A Scheff e post hoc analysis showed all three 
training groups proauced significantly morr? < ♦01) correct spellings 
than the control. Clearly, all three training conditions produced learning 
of spelling wordf3. T his means that aprllings are indeed learned from 
'*reading-like^' situatii>nc 

Analysis of Accurac y 

A second analysis determined the effects of the variables in the 
main design. The effects of ?:raining, degree of learning, and ability 
were assessed tlirough an analysis of variance on the number of correct 
spellings produced on each of the three tests, immediate and delayed 
generate-and-test tests and the delayed multiple-choice test. On the 
generate-and-test tests, students wrote ono or two sj^ellings for a word, 
then underllaed the one they tlriought correct. The underlined spelling 
was scored correct or incorrect. Total correct wa9 computed by adding 
correct single spellings and co^ rect underlined spellings. Number 
correct on the mioltiple- choice teut was the number of correctly selected 
spellings. The resizlts of a repeated measures aiialysis of variance 
showed significant main effects for ability (F = 69*49; df = 1, 103; 
2_ < - 001), training {F = 37. 30; df ^- 2. 108;^ < .001), tesfes (F = 
Z56. 19; df = 2,216; ^ <.001), and one significant interaction between 
trainijng and tests (F =^ 18. 48; df = 4, 216; < - 001). High ability 
spellers (mean total correct over all tests = 8. 1) outperformed low 
ability spellers (mean total correct over all tests = 4. 3). Degree of 
learning had no effect taken alone or in combination with the other 
variables. Six '^massed'* trials had the same effect as three ''massed" 
training trials in producing correct spellings. 
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Table 1 displays the means for each combination of training and 
test conditions. 



Table 1 

Mean Number Correct for Each Training and Test Conditicn 



Training 



TeiTS 



High 
Recognition 



Low 
Recognition 



Recal 



Total 



Generate*and-test posttest 4.00 

Generaie-and-test retention test 3.28 

Recognition retention test 7.82 

Total 5.02 



2.38 
3.10 
8.45 

4.64 



9.30 
6.73 
10.90 

7.97 



5.22 
4.37 
9.06 



As can be seen from Table 1, scores were higher on the recog- 
nition retention test relative to the generate-and-test tests and the 
recall condition produced the most learning relative to the other two 
training conditions. The locus of the interaction effect was determined 
by a simple effects analysis and dcheffe post hoc comparisons. A 
siitiple efferts analysis showed there were significant differences between 
ail three training conditions at each level of test {F = 64, 68, df = 2,45 
for the generate-and-test posttest; F = 20. 61, df = 2,45 for the generate- 
and-test retention test; F = 13.03, df_= 2,45 for the multiple -choice 
test; all < . 001). Significant effects were also found among the tests 
at each level of training condition {F = 81. 79, df = 2,216 for high recog- 
nition; F = 150. 65, df = 2, 216 for low recognition; and F = 60. 72, df = 
2,216 for recall; all < .001). Scheff^ comparisons between the number 
correct in the three training conditions for each test showed recall 
training produced significantly more (g^ < ,01) correct spellings than 
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high, or low, recognition training. The two recognition training cDn- 
ditions did not differ on 3.riy test. The superiority of constructed re- 
sponse training over miiltiple-choice training therefore held for all 
three tests. The S<-^Heffe procedure also revealed perfornaance in each 
of the three training conditic was the highest on the multiple -choice 
test, significantly higher (£ < .01) than on either generate-and-tes t. 
This resxilt is consistent with standard findings of superiority for rec- 
ognition over recall <see also HoUingv-orth' s [1918] stage model). The 
locus of the test effect within training conditions was a significant loss 
of information from immediate to delayed gene rate-and-test tests in 
tile recall condition, while no significant loss was observed for either 
recognition condition. This could be a ^'bottoming out*^ effect since the 
amount Initially acquired in the recognition conditions was relatively 
small in the first place. 



Multiple versus Single Production 

Another analysis was done to determine whether variation in training 
conditions influenced the degree to "-^hich multiple as contrasted to single 
spellings were produced on the tests. This could be construed as a 
rough index of the amount of effort necessary to produce the spellings 
requested on the tests to satisfy personal levels of confidence in a 
response. Table 2 shows for each training condition tlie words for which 
alternative spellings were generated out of a possible maximum of 15. 
The means were calculated by adding the number of words each student 
wrote multiple spellings for, then dividing by the total n\imber of students. 

It is clear from the means in Table 2 that multiple spellings were 
not very f requentp except perhaps in the low recognition condition on 
the delayed test when an average of about tliree words required multiple 
spellings. Inspection of the means in Table 2 shows that in the case of 
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xnoBt practical importance, the generate-and-test retention test, the 
low recognition condition required the most effort, more than high 
recognition and recall, when effort is measured in terms of frequency 
of multiple spellings* 



Table 2 

Mean Number of Alternatives and Standard Deviation as a Function of Practice 

Condition and Tests Administered 



Tests Administered 



High 
Recognition 



Low 
Recognition 



Mean 



SD 



Mean 



SD 



Recall 



Mean 



SD 



Generate-and-test post test 1.78 1.6 

Generate-and-test retention test 1.50 1.6 



.4 

3.3? 



.9 

2,6 



.78 

1.9 



1.0 
2.0 



One might also be concerned witli the relative effectiveness of the 
treatments had only single spellings been permitted on the tests (the 
usual condition for spelling teats) and with the amount of gain (or loss) 
contributed by the opportunity to do multiple spellings. 

Table 3 shows the mean correct for each training condition and 
each teat when (a) all words are considered and the underlined spelling 
scored correct or incorrect, or (h) only the words having single spel- 
lings are considered. Clearly, the pattern of superiority among train- 
ing conditions is the same when all spellings are considered (total) or 
when only those without alternatives are considered* Furthermore, 
the means correct with, or without, alternatives are very close, 
indicating that the opportxmity to do multiple spellings contributed 
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little to the overall mean correct*. At least as far as group means are 
concerned, the chance to do multiple spellings contributed little to a 
gain in correct spellings. tests^ comparing mean correct with and 
without alternatives revealed no significant gains, nor are there practi 
cal gains due to the use of alternatives-. 



Table 3 

M^an Number of Correct Spellings and Standard Deviations When Correci Spellings That Used 
Multiple Alternatives Are Removed and the Means Are Calculated Without Alternatives 

Test Administered 



Low recognition 



Hjgh recognitton 



Recali 



Posttesi Retention Test 



Training Conditions Spelling Correct Mean SD Mean SD 



Total 2 38 1.9 3.10 2.0 

Without recognition 2.35 ^ 9 2.75 2.0 

Total 4.00 2.6 3.28 2.6 

Without alternatives 3.72 2.6 3.00 2.7 

Total 9.30 3.3 6.73 3.8 

Without alternatives 8.92 3J 6.12 3.4 



General Discussion 

The data of the present study showed that spelling can indeed be 
learned froin ''reading -like»» situations, but subsequent achievement 
is Inferior to constructed response training. The assertion of the 
Simon model that spellings of words can be learned through visual 
exposure to the correct spelling is confirmed by these data. How- 
ever, there is a clear advantage to practice in producing the correct 
spelling relative to simply selecting it from an array. Thus, while 
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recognition plays a role in spelling, it is best viewed as a secondary 
and/or supplementary process rather than a prinnary process. If the 
learning objective is to increase a child's spelling vocabulary, then 
drills using constructed responses are effective procedures to use. 

Our data also showed no difference in amount of achievement be- 
tween high and low recognition conditions. This finding seems to 
imply that the set of distractors did not function in the manner intended, 
that is, the '*cloee in" distractorL^ did not force more attention to the 
orthographic composition of the word, which in turn would presumably 
lead to greater achievement* It would seem, therefore, that the com- 
position of the set of distractors would be unimportant, ae long as they 
consisted of other misspelled words. Our feeling, however, is that 
such a conclusion is unwarrajated because the data revealed a possible 
disadvantage to the use of "far out" distractors. The mean number of 
alternative spellings generated on the generate -and-test retention test 
tended to be greater in the low recognition condition. If this finding 
were replicated in a situation in which children were accustomed to 
generating alternative spellings on a regular basis, as a part of spel- 
ling procedure, it could possibly indicate that retrieval of the recogni- 
tion template must be strongly cued by the presence of more than one 
alternative spelling. Thus, students in the low recognition condition 
are less confident in recognizing the correct production. Such a sug- 
gestion ought to be pursued in future research. 

Pinally, our data did not show evidence of a practice effect, for 
three "massed" trials were equivalent in amovmt of learning produced 
to six "massed" trials. Possibly, students* motivation for learning 
had decreased by the later trials, thereby decreasing the amount of 
learning acquired in the later trials. A graph of the course of acqui- 
sition collapsing across treatment groups showed that the number cor- 
rect steadily increased from trials 1 to 3 with an asymptote reached 
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at trail 4» Thus, three trials are the limit for deriving benefits from 
massed practice. The data underscore the importance of distributed 
practice in spelling and when more than three trials are to be spent 
learning spelling lists, spacing should begin after trial 3, 
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